Speaker Identification Method Using Earth Mover's Distance for CCC Speaker Recognition Evaluation 2006

نویسندگان

  • Shingo Kuroiwa
  • Satoru Tsuge
  • Masahiko Kita
  • Fuji Ren
چکیده

In this paper, we present a non-parametric speaker identification method using Earth Mover’s Distance (EMD) designed for text-indepedent speaker identification and its evaluation results for CCC Speaker Recognition Evaluation 2006, organized by the Chinese Corpus Consortium (CCC) for the th International Symposium on Chinese Spoken Language Processing (ISCSLP 2006). EMD based speaker identification (EMD-IR) was originally designed to be applied to a distributed speaker identification system, in which the feature vectors are compressed by vector quantization at a terminal and sent to a server that executes a pattern matching process. In this structure, we had to train speaker models using quantized data, then we utilized a non-parametric speaker model and EMD. From the experimental results on a Japanese speech corpus, EMD-IR showed higher robustness to the quantized data than the conventional GMM technique. Moreover, it achieved higher accuracy than GMM even if the data was not quantized. Hence, we have taken the challenge of CCC Speaker Recognition Evaluation 2006 using EMD-IR. Since the identification tasks defined in the evaluation were on an open-set basis, we introduce a new speaker verification module. Evaluation results show that EMD-IR achieves 99.3 % Identification Correctness Rate in a closed-channel speaker identification task.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Evaluation of EMD-Based Speaker Recognition Using ISCSLP2006 Chinese Speaker Recognition Evaluation Corpus

In this paper, we present the evaluation results of our proposed text-independent speaker recognition method based on the Earth Mover’s Distance (EMD) using ISCSLP2006 Chinese speaker recognition evaluation corpus developed by the Chinese Corpus Consortium (CCC). The EMD based speaker recognition (EMD-SR) was originally designed to apply to a distributed speaker identification system, in which ...

متن کامل

Distributed speaker recognition using earth mover's distance

In this paper, we focus on distributed speaker recognition, a technique in which quantized feature parameters are sent to a server, as with distributed speech recognition. The Gaussian mixture model , the traditional method used for speaker recognition, is trained using the maximum likelihood approach. The GMM has output probability functions with continuous density functions. It is difficult t...

متن کامل

CCC Speaker Recognition Evaluation 2006: Overview, Methods, Data, Results and Perspective

For the special session on speaker recognition of the 5th International Symposium on Chinese Spoken Language Processing (ISCSLP 2006), the Chinese Corpus Consortium (CCC), the session organizer, developed a speaker recognition evaluation (SRE) to act as a platform for developers in this field to evaluate their speaker recognition systems using two databases provided by the CCC. In this paper, t...

متن کامل

A Comparative Analysis of Speaker Identification on English and Hindi Database

In this paper a text-dependent speaker recognition method is presented by combining Mel frequency cepstrum coefficients (MFCC) and Euclidean distance. The robustness of this speaker identification method for different speaking language is analyzed in this paper. The speaker identification algorithm using English and Hindi Indian voice database (IVD) which contains sentences of data spoken is ac...

متن کامل

Speaker recognition with penalized logistic regression machines

「罰金付きロジスティック回帰マシンを用いた話者認識」, ビルケネス・オイスティン(ノル ウェー工科大学),松井知子(統数研) Abstract We study on speaker recognition using a penalized logistic regression machine (PLRM) [1-3]. Parameters of a multiclass logistic regression model with the log-likelihood values of speaker Gaussian mixture models (GMMs) are discriminatively estimated and the model used for speaker decision. In speaker identification experimen...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • IJCLCLP

دوره 12  شماره 

صفحات  -

تاریخ انتشار 2007